Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 2159 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 168.8 KiB |
| Average record size in memory | 80.1 B |
Variable types
| Numeric | 10 |
|---|
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
SBP is highly correlated with DBP | High correlation |
DBP is highly correlated with SBP and 1 other fields | High correlation |
HR is highly correlated with DBP | High correlation |
SBP is highly correlated with DBP | High correlation |
DBP is highly correlated with SBP and 1 other fields | High correlation |
HR is highly correlated with DBP | High correlation |
Gage is highly correlated with HR | High correlation |
SBP is highly correlated with DBP | High correlation |
DBP is highly correlated with SBP and 1 other fields | High correlation |
HR is highly correlated with Gage and 2 other fields | High correlation |
RR is highly correlated with HR | High correlation |
Parity has 279 (12.9%) zeros | Zeros |
Totalrisk has 60 (2.8%) zeros | Zeros |
Reproduction
| Analysis started | 2022-02-27 16:47:42.700897 |
|---|---|
| Analysis finished | 2022-02-27 16:48:19.261202 |
| Duration | 36.56 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
Age
Real number (ℝ≥0)
| Distinct | 219 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.6488189 |
| Minimum | 15 |
|---|---|
| Maximum | 47.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 15 |
|---|---|
| 5-th percentile | 21 |
| Q1 | 25 |
| median | 30 |
| Q3 | 33.7 |
| 95-th percentile | 38.1 |
| Maximum | 47.5 |
| Range | 32.5 |
| Interquartile range (IQR) | 8.7 |
Descriptive statistics
| Standard deviation | 5.426592474 |
|---|---|
| Coefficient of variation (CV) | 0.1830289595 |
| Kurtosis | -0.6275456604 |
| Mean | 29.6488189 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 0.03710818114 |
| Sum | 64011.8 |
| Variance | 29.44790588 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 25 | 141 | 6.5% |
| 30 | 106 | 4.9% |
| 22 | 104 | 4.8% |
| 31 | 100 | 4.6% |
| 28 | 99 | 4.6% |
| 29 | 97 | 4.5% |
| 26 | 94 | 4.4% |
| 27 | 92 | 4.3% |
| 33 | 89 | 4.1% |
| 21 | 86 | 4.0% |
| Other values (209) | 1151 |
| Value | Count | Frequency (%) |
| 15 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 8 | 0.4% |
| 18 | 15 | 0.7% |
| 18.1 | 1 | < 0.1% |
| 19 | 17 | 0.8% |
| 20 | 12 | 0.6% |
| 21 | 86 | |
| 21.5 | 1 | < 0.1% |
| 21.8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 47.5 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 44.4 | 1 | < 0.1% |
| 44.3 | 1 | < 0.1% |
| 44 | 2 | |
| 43 | 3 | |
| 42.9 | 1 | < 0.1% |
| 42.6 | 2 | |
| 42.5 | 1 | < 0.1% |
| 42 | 2 |
| Distinct | 300 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.5132339 |
| Minimum | 3 |
|---|---|
| Maximum | 43 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 12 |
| Q1 | 28 |
| median | 29 |
| Q3 | 34 |
| 95-th percentile | 40 |
| Maximum | 43 |
| Range | 40 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 7.81044186 |
|---|---|
| Coefficient of variation (CV) | 0.2646420208 |
| Kurtosis | 1.234766717 |
| Mean | 29.5132339 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.9695730472 |
| Sum | 63719.072 |
| Variance | 61.00300205 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 40 | 145 | 6.7% |
| 28 | 127 | 5.9% |
| 34 | 114 | 5.3% |
| 39 | 100 | 4.6% |
| 28.29 | 82 | 3.8% |
| 32 | 82 | 3.8% |
| 28.43 | 79 | 3.7% |
| 33 | 77 | 3.6% |
| 28.14 | 76 | 3.5% |
| 30 | 73 | 3.4% |
| Other values (290) | 1204 |
| Value | Count | Frequency (%) |
| 3 | 1 | < 0.1% |
| 4 | 2 | 0.1% |
| 5 | 4 | 0.2% |
| 6 | 19 | |
| 6.2 | 1 | < 0.1% |
| 6.5 | 1 | < 0.1% |
| 6.6 | 1 | < 0.1% |
| 7 | 15 | |
| 7.3 | 1 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 42 | 15 | 0.7% |
| 41.29 | 1 | < 0.1% |
| 41 | 71 | |
| 40 | 145 | |
| 39.57 | 2 | 0.1% |
| 39 | 100 | |
| 38.9 | 1 | < 0.1% |
| 38.86 | 2 | 0.1% |
| 38.71 | 2 | 0.1% |
BMI
Real number (ℝ≥0)
| Distinct | 1087 |
|---|---|
| Distinct (%) | 50.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.80223529 |
| Minimum | 15.73 |
|---|---|
| Maximum | 67.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 15.73 |
|---|---|
| 5-th percentile | 19.3994 |
| Q1 | 22.1 |
| median | 25.21 |
| Q3 | 30.029 |
| 95-th percentile | 39.4 |
| Maximum | 67.1 |
| Range | 51.37 |
| Interquartile range (IQR) | 7.929 |
Descriptive statistics
| Standard deviation | 6.450755113 |
|---|---|
| Coefficient of variation (CV) | 0.2406797434 |
| Kurtosis | 2.331661607 |
| Mean | 26.80223529 |
| Median Absolute Deviation (MAD) | 3.59 |
| Skewness | 1.315603529 |
| Sum | 57866.026 |
| Variance | 41.61224152 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 31.6 | 12 | 0.6% |
| 24.22 | 10 | 0.5% |
| 21.3 | 10 | 0.5% |
| 28.7 | 9 | 0.4% |
| 21.62 | 9 | 0.4% |
| 32.4 | 9 | 0.4% |
| 30.8 | 9 | 0.4% |
| 20.83 | 9 | 0.4% |
| 32 | 9 | 0.4% |
| 33.2 | 8 | 0.4% |
| Other values (1077) | 2065 |
| Value | Count | Frequency (%) |
| 15.73 | 1 | |
| 15.81 | 1 | |
| 16.16 | 1 | |
| 16.33 | 1 | |
| 16.59 | 1 | |
| 16.61 | 1 | |
| 16.69 | 1 | |
| 16.77 | 1 | |
| 16.98 | 1 | |
| 17.04 | 1 |
| Value | Count | Frequency (%) |
| 67.1 | 1 | |
| 59.4 | 1 | |
| 57.3 | 1 | |
| 56.07 | 1 | |
| 55.36 | 1 | |
| 55 | 1 | |
| 53.4 | 1 | |
| 53.2 | 1 | |
| 52.9 | 1 | |
| 52.3 | 2 |
| Distinct | 461 |
|---|---|
| Distinct (%) | 21.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.5582856 |
| Minimum | 44 |
|---|---|
| Maximum | 199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 44 |
|---|---|
| 5-th percentile | 89 |
| Q1 | 105 |
| median | 114.3 |
| Q3 | 126 |
| 95-th percentile | 153 |
| Maximum | 199 |
| Range | 155 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 19.53289841 |
|---|---|
| Coefficient of variation (CV) | 0.1675805225 |
| Kurtosis | 2.101843075 |
| Mean | 116.5582856 |
| Median Absolute Deviation (MAD) | 10.7 |
| Skewness | 0.9117280779 |
| Sum | 251649.3387 |
| Variance | 381.5341205 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 120 | 54 | 2.5% |
| 109 | 52 | 2.4% |
| 110 | 51 | 2.4% |
| 107 | 48 | 2.2% |
| 111 | 47 | 2.2% |
| 112 | 45 | 2.1% |
| 113 | 42 | 1.9% |
| 114 | 42 | 1.9% |
| 108 | 40 | 1.9% |
| 105 | 39 | 1.8% |
| Other values (451) | 1699 |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 56 | 1 | < 0.1% |
| 68 | 2 | |
| 69.56 | 1 | < 0.1% |
| 71 | 4 | |
| 72 | 1 | < 0.1% |
| 73 | 3 | |
| 74 | 4 | |
| 75 | 2 | |
| 75.8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 199 | 1 | |
| 198 | 1 | |
| 197 | 2 | |
| 196 | 1 | |
| 195 | 1 | |
| 193 | 2 | |
| 191 | 1 | |
| 190 | 1 | |
| 189 | 2 | |
| 188 | 1 |
| Distinct | 409 |
|---|---|
| Distinct (%) | 18.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.23946596 |
| Minimum | 10 |
|---|---|
| Maximum | 122 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 54 |
| Q1 | 63 |
| median | 70 |
| Q3 | 78.8 |
| 95-th percentile | 92 |
| Maximum | 122 |
| Range | 112 |
| Interquartile range (IQR) | 15.8 |
Descriptive statistics
| Standard deviation | 11.93508321 |
|---|---|
| Coefficient of variation (CV) | 0.1675347092 |
| Kurtosis | 0.9979576917 |
| Mean | 71.23946596 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.3940799302 |
| Sum | 153806.007 |
| Variance | 142.4462113 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 70 | 124 | 5.7% |
| 64 | 81 | 3.8% |
| 74 | 78 | 3.6% |
| 68 | 69 | 3.2% |
| 60 | 69 | 3.2% |
| 80 | 67 | 3.1% |
| 62 | 67 | 3.1% |
| 72 | 66 | 3.1% |
| 66 | 65 | 3.0% |
| 65 | 59 | 2.7% |
| Other values (399) | 1414 |
| Value | Count | Frequency (%) |
| 10 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 30 | 2 | 0.1% |
| 34.95 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
| 39.7 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 40.53 | 1 | < 0.1% |
| 44 | 5 | |
| 44.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 122 | 1 | < 0.1% |
| 120 | 1 | < 0.1% |
| 116 | 1 | < 0.1% |
| 110 | 13 | |
| 108 | 1 | < 0.1% |
| 104.3 | 1 | < 0.1% |
| 103.3 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 102 | 2 | 0.1% |
| 100.83 | 1 | < 0.1% |
| Distinct | 1146 |
|---|---|
| Distinct (%) | 53.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.82566744 |
| Minimum | 53.1 |
|---|---|
| Maximum | 105.37 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 53.1 |
|---|---|
| 5-th percentile | 72.0686 |
| Q1 | 78.402 |
| median | 83.008 |
| Q3 | 88.42 |
| 95-th percentile | 91.8508 |
| Maximum | 105.37 |
| Range | 52.27 |
| Interquartile range (IQR) | 10.018 |
Descriptive statistics
| Standard deviation | 6.27127829 |
|---|---|
| Coefficient of variation (CV) | 0.07571660434 |
| Kurtosis | -0.1556874156 |
| Mean | 82.82566744 |
| Median Absolute Deviation (MAD) | 4.918 |
| Skewness | -0.3076240387 |
| Sum | 178820.616 |
| Variance | 39.32893139 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 89.206 | 63 | 2.9% |
| 90.874 | 42 | 1.9% |
| 89.674 | 29 | 1.3% |
| 90.498 | 29 | 1.3% |
| 92.31 | 28 | 1.3% |
| 83.008 | 18 | 0.8% |
| 90.304 | 16 | 0.7% |
| 89.618 | 15 | 0.7% |
| 90.352 | 14 | 0.6% |
| 89.574 | 13 | 0.6% |
| Other values (1136) | 1892 |
| Value | Count | Frequency (%) |
| 53.1 | 1 | |
| 61.57 | 1 | |
| 63.67 | 1 | |
| 63.82 | 1 | |
| 63.98 | 1 | |
| 64.76 | 1 | |
| 65.37 | 1 | |
| 65.982 | 1 | |
| 66.17 | 1 | |
| 66.38 | 1 |
| Value | Count | Frequency (%) |
| 105.37 | 1 | |
| 102.92 | 1 | |
| 101.48 | 1 | |
| 99.21 | 1 | |
| 98 | 1 | |
| 96.46 | 1 | |
| 96.23 | 1 | |
| 95.75 | 1 | |
| 95.64 | 1 | |
| 95.406 | 1 |
| Distinct | 991 |
|---|---|
| Distinct (%) | 45.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.75278184 |
| Minimum | 11.77 |
|---|---|
| Maximum | 25.64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 11.77 |
|---|---|
| 5-th percentile | 16.7572 |
| Q1 | 17.998 |
| median | 18.69 |
| Q3 | 19.52 |
| 95-th percentile | 20.9612 |
| Maximum | 25.64 |
| Range | 13.87 |
| Interquartile range (IQR) | 1.522 |
Descriptive statistics
| Standard deviation | 1.36514202 |
|---|---|
| Coefficient of variation (CV) | 0.0727967739 |
| Kurtosis | 2.268556758 |
| Mean | 18.75278184 |
| Median Absolute Deviation (MAD) | 0.758 |
| Skewness | -0.06571126201 |
| Sum | 40487.256 |
| Variance | 1.863612734 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 18.18 | 69 | 3.2% |
| 18.058 | 42 | 1.9% |
| 19.392 | 32 | 1.5% |
| 18.622 | 29 | 1.3% |
| 17.936 | 29 | 1.3% |
| 19.762 | 18 | 0.8% |
| 18.072 | 17 | 0.8% |
| 16.86 | 15 | 0.7% |
| 18.118 | 14 | 0.6% |
| 17.92 | 13 | 0.6% |
| Other values (981) | 1881 |
| Value | Count | Frequency (%) |
| 11.77 | 1 | |
| 12.09 | 1 | |
| 12.27 | 1 | |
| 12.41 | 1 | |
| 12.64 | 1 | |
| 13.2 | 1 | |
| 13.37 | 1 | |
| 13.4 | 1 | |
| 13.59 | 1 | |
| 14.21 | 1 |
| Value | Count | Frequency (%) |
| 25.64 | 1 | |
| 24.45 | 1 | |
| 23.97 | 1 | |
| 23.9 | 1 | |
| 23.35 | 1 | |
| 23.22 | 1 | |
| 23.19 | 1 | |
| 23.18 | 1 | |
| 22.95 | 1 | |
| 22.818 | 2 |
| Distinct | 27 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.774339972 |
| Minimum | 0 |
|---|---|
| Maximum | 12 |
| Zeros | 279 |
| Zeros (%) | 12.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 12 |
| Range | 12 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.529720763 |
|---|---|
| Coefficient of variation (CV) | 0.8621350966 |
| Kurtosis | 5.799033421 |
| Mean | 1.774339972 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.96338727 |
| Sum | 3830.8 |
| Variance | 2.340045614 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=27)
| Value | Count | Frequency (%) |
| 1 | 811 | |
| 2 | 497 | |
| 0 | 279 | 12.9% |
| 3 | 210 | 9.7% |
| 4 | 103 | 4.8% |
| 5 | 51 | 2.4% |
| 6 | 25 | 1.2% |
| 1.4 | 23 | 1.1% |
| 1.2 | 21 | 1.0% |
| 7 | 19 | 0.9% |
| Other values (17) | 120 | 5.6% |
| Value | Count | Frequency (%) |
| 0 | 279 | 12.9% |
| 0.2 | 4 | 0.2% |
| 0.4 | 5 | 0.2% |
| 0.6 | 3 | 0.1% |
| 0.8 | 12 | 0.6% |
| 1 | 811 | |
| 1.2 | 21 | 1.0% |
| 1.4 | 23 | 1.1% |
| 1.6 | 17 | 0.8% |
| 1.8 | 18 | 0.8% |
| Value | Count | Frequency (%) |
| 12 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 10 | 2 | 0.1% |
| 9 | 9 | 0.4% |
| 8 | 12 | 0.6% |
| 7 | 19 | 0.9% |
| 6 | 25 | 1.2% |
| 5 | 51 | |
| 4 | 103 | |
| 3.6 | 2 | 0.1% |
GDM
Real number (ℝ≥0)
| Distinct | 265 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 77.76462251 |
| Minimum | 48.6 |
|---|---|
| Maximum | 147.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 48.6 |
|---|---|
| 5-th percentile | 70.2 |
| Q1 | 74.16 |
| median | 77.4 |
| Q3 | 80.36 |
| 95-th percentile | 87.12 |
| Maximum | 147.6 |
| Range | 99 |
| Interquartile range (IQR) | 6.2 |
Descriptive statistics
| Standard deviation | 6.147772605 |
|---|---|
| Coefficient of variation (CV) | 0.07905616213 |
| Kurtosis | 18.02880582 |
| Mean | 77.76462251 |
| Median Absolute Deviation (MAD) | 3.24 |
| Skewness | 2.309258145 |
| Sum | 167893.82 |
| Variance | 37.795108 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 73.8 | 114 | 5.3% |
| 75.6 | 109 | 5.0% |
| 79.2 | 104 | 4.8% |
| 77.4 | 96 | 4.4% |
| 72 | 72 | 3.3% |
| 81 | 69 | 3.2% |
| 76.32 | 53 | 2.5% |
| 78.12 | 48 | 2.2% |
| 82.8 | 46 | 2.1% |
| 77.04 | 44 | 2.0% |
| Other values (255) | 1404 |
| Value | Count | Frequency (%) |
| 48.6 | 1 | < 0.1% |
| 59.4 | 1 | < 0.1% |
| 60 | 1 | < 0.1% |
| 61.2 | 2 | 0.1% |
| 62 | 2 | 0.1% |
| 63 | 6 | |
| 64 | 1 | < 0.1% |
| 64.8 | 10 | |
| 65 | 1 | < 0.1% |
| 66 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 147.6 | 1 | |
| 140.4 | 1 | |
| 129.6 | 1 | |
| 115.2 | 1 | |
| 114 | 1 | |
| 111.6 | 1 | |
| 109.8 | 1 | |
| 109 | 1 | |
| 106.2 | 1 | |
| 104.4 | 2 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.111162575 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 60 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 17.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.022300233 |
|---|---|
| Coefficient of variation (CV) | 0.4842356744 |
| Kurtosis | -0.3832023649 |
| Mean | 2.111162575 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3259905181 |
| Sum | 4558 |
| Variance | 1.045097767 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=6)
| Value | Count | Frequency (%) |
| 2 | 802 | |
| 1 | 587 | |
| 3 | 490 | |
| 4 | 203 | 9.4% |
| 0 | 60 | 2.8% |
| 5 | 17 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 60 | 2.8% |
| 1 | 587 | |
| 2 | 802 | |
| 3 | 490 | |
| 4 | 203 | 9.4% |
| 5 | 17 | 0.8% |
| Value | Count | Frequency (%) |
| 5 | 17 | 0.8% |
| 4 | 203 | 9.4% |
| 3 | 490 | |
| 2 | 802 | |
| 1 | 587 | |
| 0 | 60 | 2.8% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Age | Gage | BMI | SBP | DBP | HR | RR | Parity | GDM | Totalrisk | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 38.0 | 12.43 | 21.88 | 109.05 | 73.06 | 81.88 | 20.28 | 2.2 | 79.80 | 2 |
| 1 | 37.0 | 11.57 | 25.51 | 112.25 | 72.03 | 77.89 | 23.97 | 3.6 | 77.80 | 3 |
| 2 | 25.0 | 12.86 | 22.58 | 88.48 | 51.17 | 77.52 | 20.48 | 2.0 | 77.32 | 3 |
| 3 | 34.0 | 11.86 | 21.30 | 98.62 | 58.19 | 89.44 | 15.87 | 2.0 | 77.00 | 1 |
| 4 | 32.0 | 11.57 | 24.91 | 113.20 | 61.46 | 73.10 | 22.48 | 1.6 | 77.20 | 1 |
| 5 | 25.0 | 12.43 | 23.53 | 101.48 | 61.39 | 61.57 | 18.22 | 1.6 | 83.40 | 2 |
| 6 | 33.0 | 12.14 | 18.94 | 94.98 | 60.56 | 86.78 | 20.15 | 1.6 | 74.88 | 1 |
| 7 | 28.0 | 13.14 | 24.58 | 116.98 | 74.56 | 74.57 | 19.65 | 1.6 | 76.20 | 1 |
| 8 | 30.0 | 12.57 | 22.66 | 98.79 | 60.18 | 76.18 | 19.79 | 1.8 | 79.80 | 1 |
| 9 | 33.0 | 12.00 | 23.15 | 105.46 | 63.85 | 65.37 | 18.83 | 3.4 | 76.32 | 1 |
Last rows
| Age | Gage | BMI | SBP | DBP | HR | RR | Parity | GDM | Totalrisk | |
|---|---|---|---|---|---|---|---|---|---|---|
| 2149 | 25.0 | 34.0 | 26.0 | 108.0 | 62.0 | 83.054 | 17.394 | 3.0 | 78.12 | 3 |
| 2150 | 26.0 | 29.0 | 43.3 | 181.0 | 88.0 | 90.498 | 17.936 | 0.0 | 80.36 | 3 |
| 2151 | 37.0 | 33.0 | 36.5 | 128.0 | 88.0 | 90.754 | 16.406 | 1.0 | 78.48 | 3 |
| 2152 | 39.0 | 28.0 | 32.0 | 137.0 | 90.0 | 90.086 | 17.302 | 7.0 | 80.28 | 3 |
| 2153 | 26.0 | 32.0 | 37.5 | 106.0 | 76.0 | 80.718 | 18.292 | 1.0 | 80.28 | 2 |
| 2154 | 22.0 | 25.0 | 28.4 | 88.0 | 58.0 | 83.560 | 19.116 | 2.0 | 76.32 | 2 |
| 2155 | 33.0 | 28.0 | 22.5 | 89.0 | 62.0 | 88.284 | 16.228 | 1.0 | 75.96 | 3 |
| 2156 | 27.0 | 31.0 | 36.8 | 122.0 | 70.0 | 80.294 | 20.378 | 2.0 | 90.36 | 2 |
| 2157 | 30.0 | 34.0 | 26.2 | 121.0 | 72.0 | 86.616 | 20.182 | 5.0 | 75.60 | 2 |
| 2158 | 23.0 | 25.0 | 30.4 | 93.0 | 70.0 | 86.582 | 19.246 | 1.0 | 77.40 | 2 |
Most frequently occurring
| Age | Gage | BMI | SBP | DBP | HR | RR | Parity | GDM | Totalrisk | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 22.0 | 31.0 | 31.96 | 110.0 | 70.0 | 81.45 | 16.862 | 1.0 | 75.6 | 2 | 2 |